Restructuring exponential family mixture models

نویسندگان

  • Pierre L. Dognin
  • John R. Hershey
  • Vaibhava Goel
  • Peder A. Olsen
چکیده

Variational KL (varKL) divergence minimization was previously applied to restructuring acoustic models (AMs) using Gaussian mixture models by reducing their size while preserving their accuracy. In this paper, we derive a related varKL for exponential family mixture models (EMMs) and test its accuracy using the weighted local maximum likelihood agglomerative clustering technique. Minimizing varKL between a reference and a restructured AM led previously to the variational expectation maximization (varEM) algorithm; which we extend to EMMs. We present results on a clustering task using AMs trained on 50 hrs of Broadcast News (BN). EMMs are trained on fMMI-PLP features combined with frame level phone posterior probabilities given by the recently introduced sparse representation phone identification process. As we reduce model size, we test the word error rate using the standard BN test set and compare with baseline models of the same size, trained directly from data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variational Bayesian Dirichlet-Multinomial Allocation for Exponential Family Mixtures

We study a Bayesian framework for density modeling with mixture of exponential family distributions. Our contributions: •A variational Bayesian solution for finite mixture models • Show that finite mixture models (with a Bayesian setting) can determine the mixture number automatically • Justify this result with connections to Dirichlet Process mixture models •A fast variational Bayesian solutio...

متن کامل

Mixture Models

Consider the task of summarizing the data in Figure 1. A common technique for performing this task is to use a statistical model known as a mixture model. Relative to many other models for estimating densities, mixture models have a number of advantages. First, mixture models can summarize data that contain multiple modes. In this sense, they are more powerful than distributions from the expone...

متن کامل

Exponential-Family Random Graph Models with Time Varying Network Parameters

Dynamic networks are a general language for describing time-evolving complex systems, and have long been an interesting research area. It is a fundamental research question to model time varying network parameters. However, due to difficulties in modeling functional network parameters, there is little progress in the current literature to effectively model time varying network parameters. In th...

متن کامل

Small-Variance Asymptotics for Exponential Family Dirichlet Process Mixture Models

Sampling and variational inference techniques are two standard methods for inference in probabilistic models, but for many problems, neither approach scales effectively to large-scale data. An alternative is to relax the probabilistic model into a non-probabilistic formulation which has a scalable associated algorithm. This can often be fulfilled by performing small-variance asymptotics, i.e., ...

متن کامل

A generalized F mixture model for cure rate estimation.

Cure rate estimation is an important issue in clinical trials for diseases such as lymphoma and breast cancer and mixture models are the main statistical methods. In the last decade, mixture models under different distributions, such as exponential, Weibull, log-normal and Gompertz, have been discussed and used. However, these models involve stronger distributional assumptions than is desirable...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010